Heading

Introduction

[1] "This dataset, sourced from vgchartz.com, offers a wealth of insights into the dynamics between platform and genre for the top 100 video games worldwide. Observe which platforms are driving global sales, what genres have been most successful in different regions across the world, and how both of these factors have changed over time. Analyze this data to inform your understanding of the gaming industry and discover trends propelling game developers to success "

Description of Data

Data Contains below columns and data-types:

Descriptive Stat of Data

# A tibble: 1 × 9
  Length Minimun Maximum  Mean Mode    Median    SD Kurtosis Skewness
   <int>   <dbl>   <dbl> <dbl> <chr>    <dbl> <dbl>    <dbl>    <dbl>
1  16598       0    41.5 0.265 numeric   0.08 0.817     652.     18.8
# A tibble: 1 × 9
  Length Minimun Maximum  Mean Mode    Median    SD Kurtosis Skewness
   <int>   <dbl>   <dbl> <dbl> <chr>    <dbl> <dbl>    <dbl>    <dbl>
1  16598    0.01    82.7 0.537 numeric   0.17  1.56     607.     17.4

Descriptive Statistics

      Rank           Year              NA_Sales          EU_Sales      
 Min.   :    1   Length:16598       Min.   : 0.0000   Min.   : 0.0000  
 1st Qu.: 4151   Class :character   1st Qu.: 0.0000   1st Qu.: 0.0000  
 Median : 8300   Mode  :character   Median : 0.0800   Median : 0.0200  
 Mean   : 8301                      Mean   : 0.2647   Mean   : 0.1467  
 3rd Qu.:12450                      3rd Qu.: 0.2400   3rd Qu.: 0.1100  
 Max.   :16600                      Max.   :41.4900   Max.   :29.0200  
    JP_Sales         Other_Sales        Global_Sales    
 Min.   : 0.00000   Min.   : 0.00000   Min.   : 0.0100  
 1st Qu.: 0.00000   1st Qu.: 0.00000   1st Qu.: 0.0600  
 Median : 0.00000   Median : 0.01000   Median : 0.1700  
 Mean   : 0.07778   Mean   : 0.04806   Mean   : 0.5374  
 3rd Qu.: 0.04000   3rd Qu.: 0.04000   3rd Qu.: 0.4700  
 Max.   :10.22000   Max.   :10.57000   Max.   :82.7400  

Inferential Statistics

T test


    One Sample t-test

data:  df$NA_Sales
t = 41.752, df = 16597, p-value < 2.2e-16
alternative hypothesis: true mean is not equal to 0
95 percent confidence interval:
 0.2522422 0.2770927
sample estimates:
mean of x 
0.2646674 

Chi-Square Test


    Pearson's Chi-squared test

data:  cross_table
X-squared = 3416768, df = 253776, p-value < 2.2e-16

Anova

               Df Sum Sq Mean Sq F value Pr(>F)    
Genre          11    479   43.56   18.22 <2e-16 ***
Residuals   16586  39654    2.39                   
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Correlation Matrix

              NA_Sales  EU_Sales  JP_Sales Other_Sales Global_Sales
NA_Sales     1.0000000 0.7677267 0.4497874   0.6347373    0.9410474
EU_Sales     0.7677267 1.0000000 0.4355845   0.7263849    0.9028358
JP_Sales     0.4497874 0.4355845 1.0000000   0.2901862    0.6118155
Other_Sales  0.6347373 0.7263849 0.2901862   1.0000000    0.7483308
Global_Sales 0.9410474 0.9028358 0.6118155   0.7483308    1.0000000

Regression


Call:
lm(formula = reg_model, data = df)

Residuals:
   Min     1Q Median     3Q    Max 
-3.881 -0.460 -0.228  0.040 81.855 

Coefficients:
                   Estimate Std. Error t value Pr(>|t|)    
(Intercept)        1.249811   0.505163   2.474 0.013368 *  
Platform3DO        0.161739   0.929686   0.174 0.861890    
Platform3DS        0.951928   0.311311   3.058 0.002233 ** 
PlatformDC         0.214438   0.370940   0.578 0.563208    
PlatformDS         0.762373   0.303782   2.510 0.012096 *  
PlatformGB         2.071385   0.332082   6.238 4.55e-10 ***
PlatformGBA        0.383973   0.307153   1.250 0.211279    
PlatformGC         0.379821   0.309319   1.228 0.219492    
PlatformGEN        0.634250   0.433739   1.462 0.143682    
PlatformGG        -1.207191   1.561318  -0.773 0.439423    
PlatformN64        0.448814   0.319296   1.406 0.159851    
PlatformNES        0.992377   0.318640   3.114 0.001846 ** 
PlatformNG         0.020617   0.539697   0.038 0.969528    
PlatformPC         0.615821   0.305141   2.018 0.043591 *  
PlatformPCFX      -0.396993   1.546651  -0.257 0.797430    
PlatformPS         0.415351   0.309081   1.344 0.179022    
PlatformPS2        0.712810   0.303171   2.351 0.018725 *  
PlatformPS3        1.104892   0.305286   3.619 0.000296 ***
PlatformPS4        1.458823   0.318949   4.574 4.82e-06 ***
PlatformPSP        0.599494   0.305113   1.965 0.049451 *  
PlatformPSV        0.707332   0.314620   2.248 0.024576 *  
PlatformSAT        0.044508   0.330190   0.135 0.892776    
PlatformSCD        0.184523   0.701922   0.263 0.792645    
PlatformSNES       0.629542   0.325118   1.936 0.052842 .  
PlatformTG16       0.098382   1.117970   0.088 0.929877    
PlatformWii        1.088975   0.304971   3.571 0.000357 ***
PlatformWiiU       0.987758   0.330632   2.987 0.002817 ** 
PlatformWS         0.116715   0.689615   0.169 0.865605    
PlatformX360       1.123607   0.305068   3.683 0.000231 ***
PlatformXB         0.337398   0.307048   1.099 0.271853    
PlatformXOne       1.249837   0.324541   3.851 0.000118 ***
GenreAdventure    -0.249163   0.050810  -4.904 9.49e-07 ***
GenreFighting     -0.008183   0.059491  -0.138 0.890602    
GenreMisc         -0.073047   0.045772  -1.596 0.110535    
GenrePlatform      0.327900   0.058639   5.592 2.28e-08 ***
GenrePuzzle       -0.182459   0.069896  -2.610 0.009051 ** 
GenreRacing        0.027550   0.051216   0.538 0.590641    
GenreRole-Playing  0.096753   0.047926   2.019 0.043522 *  
GenreShooter       0.221926   0.050301   4.412 1.03e-05 ***
GenreSimulation   -0.064418   0.059149  -1.089 0.276135    
GenreSports       -0.011782   0.041720  -0.282 0.777641    
GenreStrategy     -0.230773   0.065637  -3.516 0.000439 ***
Year1981          -0.532073   0.551734  -0.964 0.334877    
Year1982          -0.508151   0.564133  -0.901 0.367727    
Year1983          -0.691531   0.633955  -1.091 0.275367    
Year1984           1.394957   0.711143   1.962 0.049829 *  
Year1985           1.673500   0.703472   2.379 0.017375 *  
Year1986          -0.522631   0.668201  -0.782 0.434139    
Year1987          -0.578625   0.661127  -0.875 0.381472    
Year1988           0.885906   0.688760   1.286 0.198380    
Year1989           1.538178   0.676190   2.275 0.022932 *  
Year1990           0.707488   0.691589   1.023 0.306328    
Year1991          -1.488911   0.631422  -2.358 0.018384 *  
Year1992          -0.330520   0.633423  -0.522 0.601817    
Year1993          -1.164026   0.626196  -1.859 0.063061 .  
Year1994          -1.129463   0.608291  -1.857 0.063360 .  
Year1995          -1.254575   0.599727  -2.092 0.036462 *  
Year1996          -0.919571   0.597644  -1.539 0.123906    
Year1997          -0.989417   0.597454  -1.656 0.097730 .  
Year1998          -0.997435   0.595940  -1.674 0.094205 .  
Year1999          -0.973807   0.596659  -1.632 0.102678    
Year2000          -1.242474   0.594119  -2.091 0.036517 *  
Year2001          -1.136030   0.591425  -1.921 0.054770 .  
Year2002          -1.293703   0.590676  -2.190 0.028523 *  
Year2003          -1.319081   0.590907  -2.232 0.025609 *  
Year2004          -1.232927   0.590819  -2.087 0.036921 *  
Year2005          -1.342564   0.590058  -2.275 0.022900 *  
Year2006          -1.441833   0.589796  -2.445 0.014510 *  
Year2007          -1.558979   0.589615  -2.644 0.008199 ** 
Year2008          -1.617913   0.589406  -2.745 0.006058 ** 
Year2009          -1.635988   0.589482  -2.775 0.005521 ** 
Year2010          -1.633260   0.589682  -2.770 0.005617 ** 
Year2011          -1.693170   0.589898  -2.870 0.004106 ** 
Year2012          -1.600906   0.591582  -2.706 0.006814 ** 
Year2013          -1.542091   0.592405  -2.603 0.009247 ** 
Year2014          -1.689756   0.592844  -2.850 0.004374 ** 
Year2015          -1.875667   0.593258  -3.162 0.001572 ** 
Year2016          -2.144004   0.596194  -3.596 0.000324 ***
Year2017          -2.255475   1.055928  -2.136 0.032693 *  
Year2020          -1.657767   1.623782  -1.021 0.307303    
YearN/A           -1.618747   0.585397  -2.765 0.005695 ** 
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Residual standard error: 1.513 on 16517 degrees of freedom
Multiple R-squared:  0.0584,    Adjusted R-squared:  0.05384 
F-statistic:  12.8 on 80 and 16517 DF,  p-value: < 2.2e-16

Plots

Relationship between Genre and Global Sales

               Df Sum Sq Mean Sq F value Pr(>F)    
Genre          11    479   43.56   18.22 <2e-16 ***
Residuals   16586  39654    2.39                   
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Video games Sales dashboard

Genre Distribution

Sales By Region

Top Publishers

Platform Distribution

Global Sales Over Time

Average Rank By Platform

Total Sales by Platform

Interactive Table